DeepSeek LLM
https://github.com/deepseek-ai/DeepSeek-LLMDeepSeek-LLM
https://arxiv.org/abs/2401.02954DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
https://gyazo.com/c8cc96c47fdfa4099b990cc8bda44508
https://gyazo.com/fefb221dc683be74e3b15fa765bced2a
Llama-2-70bとの比較
https://huggingface.co/collections/deepseek-ai/deepseek-llm-65f2964ad8a0a29fe39b71d8deepseek-ai/deepseek llm
https://huggingface.co/deepseek-ai/deepseek-llm-7b-base/deepseek-llm-7b-basedeepseek-llm-7b-base/deepseek-llm-7b-base
https://huggingface.co/deepseek-ai/deepseek-llm-7b-chat/deepseek-llm-7b-chatdeepseek-llm-7b-chat/deepseek-llm-7b-chat
https://huggingface.co/deepseek-ai/deepseek-llm-67b-basedeepseek-ai/deepseek-llm-67b-base
https://huggingface.co/deepseek-ai/deepseek-llm-67b-chatdeepseek-ai/deepseek-llm-67b-chat